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METHOD AND APPARATUS FOR COMPRESSING COMPUT ED 
TOMOGRAPHY RAW PROJECTION DAT A 

BACKGROU3SD AND SUMMARY OF THE INVENTION: 
Computed Tomography (CT) 

Computed Tomography (CT) is a diagnostic imaging modaUty that produces 
5 three dimensional (3D) images of the hody by making multiple x-ray exposures at 
different locations around the patient. The basic system comprising a CT scanner as 
well known in the art includes a scanning gantry, x-ray generator, computer system, 
viewing and operator consoles, and a hard copy camera, all as shown in fig. 11. The 
scanning gantry as well known in the art contains an x-ray tube, collimator, detector 

10 array, and associated data acquisition electronics. The CT scanning process involves 
collecting raw projection data by means of projecting highly collimated x-rays across 
a patient from different orientations. From these raw projection data (the sinogram), 
the internal structure of the object can be computed by various reconstruction schemes 
[Kak & Slaney], Because of its medical utility, the use of CT imaging continues to 

15 grow, and with it the amount of patient data that must be handled, transmitted and 
stored. 

Advances in CT technology 

Recent advances in technology, such as hehcal or spiral scanning with slip- 
ring design [Crawford] and multirow detector technology, enable faster scanning of 

20 more patients and larger volumes for each patient. A slip-ring technology allows 
continuous rotation of the x-ray tube and detector array assembly. Spiral CT has 
reduced scan times and allowed for continuous data acquisition for volumetric 
medical imaging. Because the raw data are acquired in a continuous fashion in spiral 
CT, any transverse sUce can be specified for reconstruction at any longitudinal 

25 interval of scanning. These images generated with a small reconstruction increment 
provide ova-lapping image slices and thus improve both for hi^- and low-contrast 
resolution pCalender et al.]. Improving longitudinal resolution due to overlapping 
reconstruction is important for voltimetric imaging, as the in-plane resolution is 
usually much higher (on the order of 5-10) than the longitudinal resolution. For 

30 example, high-resolution CT images of the cochlea are obtained by scanning the 
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cochlea and temporal bone with 1 mm collimation but reconstracting images at every 
0.1 mm. In general, a higher image resolution provides a better diagnosis and finer 
assessment of small structures such as a stenosis in a vessel. Thin slice volume 
scanning is particularly useful for virtual colonoscopy and CT angiography. Thus, 
5 there is a trend to conduct higher resolution tomography which increases the amount 
of data to be handled and stored. 

A recent development in technology is the introduction of multirow detector CT 
scanners. Instead of the 1 -dimensional detectors, these scanners use 2-dimensional 
detectors that consist of several arrays of detectors. Thus, when using a scanner 
10 having n detector arrays, the volume scan time T to obtain the same amoimt of data is 
reduced to roughly T/n. In addition, the x-ray tube's output can be more efficiently 
used. 

Increase in CT data 

Data voliimes have increased tremendously with new faster and finer CT 

15 scanners. In order to handle the large data volimies, advances in computer technology 
and storage devices are necessary. In addition, advances in digital technology and 
algorithm development aUow interactive and more thorough display of patient image 
data by physicians and radiologists, for example in Picture Archiving and 
Commimication Systems (PACS) or with Computer Aided Diagnosis (CAD). Thus, 

20 the volume of CT digital data that must be accommodated is constantly increasing, 
and expected to continue increasing. In ihe prior art, CT images were printed on laser 
fihn, viewed on a Ught box, and stored in fihn folders in a fihn library. However, the 
costs for manual labor, propensity to misplace films, and need to distribute images to 
multiple caregivers in different locations has motivated the development of PACS. A 

25 serious challenge to implementing PACS is the amount of data tiiat must be 

accommodated. CT images typically consist of 512 by 512 pixels, with 12 bits of 
gray scale resolution, so each image is about 0.5MB (4,000,000 bits) in size. There 
may be 100-200 images for each patient study, so the complete data set may reach 
more than 100MB. Because of this large size, digital data sets rq)resenting images 

30 are usually only temporarily saved, with the ultimate record of patient images being 
the physical film copy. However, the advent of digital systems is making the storage 
of digital image data more compelling. Also, as computers become less expensive 
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and more powerful, it is possible to have distributed workstations to reconstruct 
images from raw data for diagnostic viewing. In this case, large data files must be 
transmitted across the network, requiring waits of long duration. Decreasing the size 
of files to be transmitted would increase system performance. 
5 Advantages of storing raw data rather than image data 

One of the important features of spiral CT is that retrospective reconstruction 
can be performed. Spiral raw data are collected at first and stored instead of 
reconstructed images. Then, these raw data can be recalled later to generate images 
with different reconstruction increments depending on clinical applications. 

10 Similarly, in a multirow spiral CT, images with different slice thickness can be 

generated firom the same set of raw data, depending on clinical applications. Thus, 
there is a need in the art, and a perceived benefit, for storing raw data as it allows a 
medical professional to revisit not only the same images previously used (by saving 
the particular iadex identifying the views then reviewed), but also to consider the 

15 additional input of views not previously considered which can be fireshly constructed 
firom the raw data. This can help a medical professional giving a second opinion, the 
same medical professional who might want to reconsider his original opinion, or even 
to possibly do other diagnoses from data not previously considered or relating to the 
prior medical symptom. However, despite these added benefits of having original raw 

20 data, present practice is to save only the image data and the raw data are deleted at the 
time of image data archival, as the raw data is typically 3-6 times larger than the 
image data for a given slice coverage. For example, for a chest CT examination to 
cover 30 cm with 1 mm coUimation and 1.5 table increment, the size of raw data is 
400 MB, while the size of 60 images with 5 mm shce thickness is 30 MB. Thus, the 

25 trend in the prior art is to focus on starting with the image data and then to work on 
finding better ways to save or store it as the overwhelming crush of image data alone 
is generally considered monumental so that those of skill in the art don't seek to 
compound the problem by choosing an inherently larger data set to begin with. 
Prior Art Image Data Compression 

30 Methods considered in the prior art for reducing the amount of data to be 

handled and stored include image data compression; using transformations and 
encoding to represent image information with fewer bits [Rabbani and Jones]. Many 
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schemes have been attempted to do this [Foos]. One class of algorithms, called 
lossless, produces no change in the final data. However, because of the information 
content (entropy) of the image, compression ratios achieved by this technique are 
typically in the range of 2-3 to 1 and are thus of limited value. Techniques that allow 
5 differences between the original and reconstructed compressed image can reach much 
higher compression ratios, with more compression generally resulting in higher 
distortion of the final image upon recall. Studies have shown that recalled CT images 
with compression ratios in the range of 8:1 have observable distortions, which may 
affect the confidence of the interpretation by radiologists. In the present legal and 

10 medical environment, such levels of compression are thus not practical as in many 
cases the true value of image storage is to acciurately reproduce what the medical 
professional made reference to when the initial diagnosis was made. Introducing 
artifacts into the recalled image interferes with this purpose which may even defeat 
the value of storing any particular image. 

1 5 Under present prior art medical practice, the original raw proj ection data that 

is used to reconstruct image sUces is routinely kept for only a day or so, in case 
further views are needed for diagnosis. The size of the projection data can be quite 
large, as each measurement involves approximately 1000 detectors, with tens of 
thousands of measurements across the volume, resulting in data sizes of 100MB per 

20 patient. In fact, the size of the projection data is usually as large or larger than the 
total reconstructed image shoes. Therefore, it is not normally saved with the digital 
image data for a typical cUnical examination except for special research purposes. 

Thus there is a long felt need to more effectively manage digital data for CT 
imaging in general, and a teaching in the prior art that compressing image digital data 

25 is the only option to reduce the size of the data to be saved. 
BRIEF DESCRIPTION OF THE DRAWINGS: 

Fig. la&b are images of a non-spiral abdominal CT image, and its sinogram; 
Fig. 2a&b are images of a sinogram compressed by 12:1 firom fig. lb, and a 
difference image representing the differences between fig. lb and 2a; 

30 Fig. 3a&b are an image reconstructed fi-om the 12:1 compressed data of fig. 

2a, and an original image compressed with 12:1 ratio; 
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Fig. 4a&b are images showiag differences between the original and fig. 3a, 
and differences between original and fig. 3b; 

Fig. 5a&b are head images witiiout compression, reconstructed from 12: 1 
sinogram compressed data, and compressed with 12:1 compression ratio; 
5 Fig. 6a&b are difference images between the original and the 12:1 sinogram 

- compressed data set, and the original and the 12:1 compressed image data set; 

Fig. 7a&b are a head image reconstructed from a 23:1 sinogram compressed 
data set, and a 20:1 direct compression; 

Fig. 8a&b are a spiral chest image with a nodule, and its corresponding 
10 sinogram; 

Fig. 9a, b &c are spiral chest images without compression, reconstructed from 
a 12:1 sinogram compressed data set, and compressed with 12:1 compression ratio; 

Fig. 10 is a block diagram of a CT scanner detailing the data flows for the data 
collected and processed in a typical CT scan; and 
15 Fig. 1 1 is a diagram of a CT scanner. 

Fig. 12 is a Head CT image , its original and compressed sinograms, and 
difference between the sinograms. 

Fig. 13 is a compressed spiral lung CT image displayed in (A, B) lung and (C, 
D) soft-tissue window settings. 
20 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT: 

This invention involves an improved method to reduce the amount of digital 
data required for creating CT images for diagnostic purposes. The inventors have 
discovered that by compressing the raw digital projection data, its size can be 
efficiently reduced while preserving useful diagnostic information sufficient to 
25 reconstruct not only the original im£iges but also new views c^tured within the 

projection data. Having the original projection data available provides the additional 
benefit of allowing additional viewing and processing at a later date such as might be 
beneficial for legal purposes, or even to provide for second opinions or for other 
diagnostic purposes. 

30 The raw projection data is sometimes termed a "sinogram" in the industry, 

because the data appears as a set of sinusoidal paths during the scan. This is caused 
by the superposition of objects in the transmitted x-ray beams and results in a 
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sinogram "image" with a highly correlated structure. This correlation of the 
projection data allows certain compression schemes to efficiently reduce the amoimt 
of stored data. The sinogram can correspond to a stationary scan of a single plane 
(leading to a "step and repeat" acquisition process for the full volume) or the 
5 continuous movement of the object during the scan ("helical" or "spiral" CT). To 

generate high quahty images, many samples of the projection beam are required: there 
are typically 1000 detectors that may each take 1000 measurements per complete 
revolution of the x-ray source. To cover the desired volume, there may be 10-40 
revolutions required, resulting in hundreds of megabytes of projection data per patient 
10 scan. 

Reconstruction processes, such as filtered back projection or simultaneous 
algebraic reconstruction, produce planar image shoes fi-om the sinogram data. While 
the sinogram contains information about all objects in its field of view (FOV» on the 
order of 50 cm), typically only a small area is of interest, so after a preview a subset 

15 region of interest is reconstructed and ultimately saved. Thus, schemes that compress 
images may have the advantage that smaller, more correlated data sets are 
addressed^ — but at the ejqjense of loss of information about the whole view of the 
patient and altemative views of the area of interest. 

As an example of the present inventive technique, CT data sets were acquired 

20 from a Siemens Somatom Plus 4 Scanner. Sinograms consist of either 768 or 1536 
detectors with 1252 samples per gantry revolution. It should be noted, althougih well 
known to those in the art, that each CT manufacturer provides its sinograms written in 
a proprietary format, so some hmited amount of decoding is required to arrive at the 
true projection data. For instance, the data file usually has a header section, wherein 

25 text or binary information is stored that records the patient name, time of scan, 
hospital location, etc. This information must be preserved without loss, so it is 
handled separately. (Usually it is small in size, such as 4096 or 8192 bytes in length.) 
Also, there may be digital information mixed in with the scan signal measurements, 
such as data recording the table position, gantry angle position, amplifier gain factor, 

30 etc., of each measurement line. Again, the size of this information is small (tens of 
bytes per gantry position) and must be preserved without loss, so it is separated and 
saved without compression. The sample data was extracted and processed into units 
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representing the logarithm of the ratio of the xraattenuated x-ray beam to the 
transmitted x-ray beam. However, other data representations may be used as is 
convenient to the particular CT scanner data set output. This data set was then 
compressed using the JPEG-LS software [Weinberger, ISO/TEC 14995], which uses a 
5 low complexity pattem generation and run-lengfli encoding scheme to represent 
information- The amount of compression is controlled by specifying the maximum 
allowable error between an original pixel value and the reconstructed one, e.g., error = 
2 means that all pixels in the final image will be within 2 counts of their original 
value. By specifying an error of 0, the compression will be lossless. By specifying 

10 tiie allowable error to be some multiple of the intrinsic noise in the CT acquisition 
. process, the error introduced will be on the order of the underlying physical noise and 
therefore be minimally objectionable [Yoimg, et al.]. For sinogram data, the prime 
noise source is the quantum statistics of the x-ray photons. The signal-to-noise ratio 
increases as the square root of the number of photons detected. Since the signals are 

1 5 encoded in attenuation (logarithmic) space, the noise associated at any level will be a 
constant times one-half the attenuation number. Thus, the preferred embodiment is to 
transform the stored attenuation code value by quantizing it in steps proportional to 
some hnear fraction of its value. The proportionality constant is determined by 
measuring the standard deviation of the signal level at some particular set of the 

20 scanner parameters of collimation, rotation time and tube current. The product of 
these three determine the x-ray flux that is sensed by the detector. Therefore from a 
measxirement of the standard deviation of the signal at single scanner combination, 
one scales the quantization step by the ratio of the product of the three parameters. 
For example, if the standard deviation in a uniform region of the sinogram signal (at 

25 value 10000 code values) is 10 code values for scanner parameters of Smm 

collimation, 150 mA tube current and gantry rotation time 750ms, the quantization 
step at signal level 5000 would be 5 code values. If another scan was made on the 
same scanner with 300mA, the quantization step for code value 10000 would be 
reduced to 5 code values. Encoding with this quantization scheme gives the minimum 

30 entropy for the code values, which are then compressed as above. When the 

compressed data is needed for image reconstruction, e.g., after retrieval from a storage 
device or transmission across a network, the compressed signal data is reconstituted 
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by an inverse mapping of the compressor (and quantizer if necessary), the digital data 
(header and information words) are added in their original location, and the data file is 
ready for computation. 

The sample sinograms were compressed by varying amoimts and then returned 
5 to the CT console for reconstruction. Also, images reconstructed firom the original 
sinogram were compressed by varying amounts and decompressed for comparison. It 
was found that the compression ratio for a given specified error depended greatly on 
tiie scoae being compressed, so the program was varied to achieve comparable 
compression ratios for image comparison. Observation of the images determined that 

10 compression ratios of less than 15:1, and preferably between 7:1 and 10:1, gave 
clinically useful images, while higher compression ratios compromised diagnostic 
utility. At comparable compression ratios, images firom the compressed sinograms 
were similar in quality to images that had been directly compressed, even though the 
compressed images covered only Vt. to Va, of the total area of the sinogram. Artifacts 

15 introduced by compressing sinograms tended to be of lower spatial frequency, and 
therefore degraded high contrast regions less. These images and related difference 
images and other data are shown in Fig. 1 to 9 appended hereto. 

Still another set of examples are reported as follows. Clinical diagnostic 
images of chest, abdomen, and head were collected along with their corresponding 

20 raw projection data (sinograms) from a Somatom Plus-4 scanner (Siemens Medical 
Systems, Iselin, NJ). The chest projection scan was obtained from a patient witii a 
spiculated lung nodule using a high-resolution spiral CT technique (collimation 2 mm 
and pitch 1). The abdomen scan was acquired using a standard non-spiral technique 
with 8 mm collimation to evaluate for a questionable Uver mass, while the head scan 

25 was acquired using the same non-spiral technique to evaluate for a severe headache. 
These scans were diagnosed normal. It was our intention to test the compression 
technique in both spiral and non-spiral scans. In our data set, the choice of spiral or 
non-spiral CT scans for a body part was arbifrary and based on the availability of 
scans at the time of data collection. The collected CT images and their sinograms 

30 were electronically transferred to a workstation. 

Sinograms consisted of either 768 or 1536 detectors with 1252 samples per 
gantry revolution. The data were extracted and processed into units representing the 
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logarithm of the ratio of the imattenuated x-ray beam to the transmitted x-ray beam. 
Data compression was performed xising the JPEG-LS implementation (LOCO, 
Hewlett-Packard, Palo Alto CA), which uses a low complexity pattem generation and 
run-length encoding scheme for data representation. The program executed rapidly, 
5 encoding or decoding at a speed of approximately 1 megabytes/sec. 

Sinograms were compressed and decompressed, and then transferred back to 
the CT scanner console for image reconstruction. Also, images reconstructed from the 
original sinogram were compressed by compression ratios comparable to the 
sinogram compression and decompressed. It was found that the compression ratio for 

10 a given specified error depended on the details of the scanned obj ect being 

compressed, therefore, program parameters were varied to achieve comparable 
compression ratios for image comparison. 

Images from compressed sinograms wCTe compared with those from direct 
image compression. For each of the three body types, image sets (consisting of an 

15 original, images compressed at three levels, and images reconstructed with three 

levels of compression) were viewed on the scanner console workstation. The quahty 
of these images was directly compared using a side-by-side display. No systematic or 
blinded analysis to grade the image quality was attempted. Qualitative properties 
examined included low level contrast in homogeneous portions of image, edge 

20 distortion, and artifact visibility. In addition, a difference image between the original 
and compressed images was generated for each image data and studied for correlated 
structure or artifacts. The root mean square error (RMSE) of the difference, which is a 
commonly used engineering metric for image degradation, was computed for a 
quantitative measure. 

25 RESULTS 

A non-spiral abdominal CT image, its original and 12:1 compressed 
sinograms, and the difference of these sinograms are shown in Figures 1 and 2. The x- 
coordinate of the sinogram represents the loci of the x-ray tube, while the y- 
coordinate represents the loci of the detectors. Each horizontal line of the sinogram 

30 represents transmitted x-ray beam signals recorded at each detector during one 

revolution of the CT gantry. The bright, white-cloud area of the sinogram corresponds 
to the abdomen cross-section, while the peripheral dark area corresponds the 
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surrounding air. Although it is difficult to distinguish the compressed from the 
original sinograms by visual inspection, the difference is the most prominent at the 
boundaries of the abdomen, the CT table, and air because of the homogeneous texture 
of unattenuated x-rays in air. 
5 An image reconstructed from the 12:1 compressed sinogram in Figures 1 and 

2 and an image with 12:1 direct compression are shown in Figures 3 A and 3B, 
respectively. Difference images were generated between the original and the 
sinogram-compression images and between the original and the direct-compression 
images, shown in Figure 4A and 4B, respectively. These compressed abdominal 

10 images were not easily discernible from the original image in Figures lA. Compared 
to the sinogram-compression image, however, the direct-compression image 
demonstrated a sUght increase in low-level white noise, particularly in homogenous 
areas of the subcutaneous fat and the hver. This increase in low-level noise is clearly 
demonstrated qualitatively in the difference images and quantitatively from the 

15 computation of the root mean square errors (RMSE) of the difference images. The 
RMSE that quantifies the magnitude of noise was 10.62 and 13.47 in the difference 
images from the sinogram-compression and the direct-compression, respectively. No 
high-level structural noise or artifacts were introduced from the compression. 

A non-spiral head CT image, its original and 12:1 compressed sinograms, and 

20 the difference between these sinograms are shown in Figure 12. The cross-sectional 
area of the head is smsdler than that of the abdomen, resulting in a larger area of air 
space in surrounding the head in the CT scan. This large air space is clearly delineated 
in the sinograms. An image reconstructed from the compressed sinogram in Figure 12 
and an image with 12:1 direct compression are shown in Figures 5B and 5C, 

25 respectively. Although these images were difficult to discern from the original, the 
direct-compression image showed more increase in low-level white noise than the 
sinogram-compression image. This increase in noise was confirmed in the difference 
images, as shown in Figures 6A and 6B, and was quantified in the RMSE 
measurements: 2.35 in the direct-compression image difference and 1.46 in the 

30 sinogram-compression image difference. 

The head image reconstructed from a 23:1 sinogram compression is shown in 
Figure 7. For comparison, a direct-compression image was generated from the 
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original image with 20:1 compression ratio (Figure 7). Both images were profoxmdly 
degraded from the originai image. Marked increase in low-level noise was observed 
tn the sinogram-compression image, resulting in a substantial loss of the gray and 
white matter differentiation. On the other hand, the direct-compression head image 
5 showed more severe image degradation and the presence of high-level stmctural noise 
and streaky artifacts. 

An image from a spiral lung CT scan is shown with its spiral sinogram in 
Figure 8. At the same slice level, an image was reconstructed from 12:1 sinogram 
compression in a lung window (Figure 13 A) and a soft-tissue window setting (Figure 

10 13C). For comparison, a 12:1 direct-compression unage was generated in a lung 
window (Figure 13B) and a soft-tissue window setting (Figure 13D). The images 
displayed in the soft-tissue window setting show slightly more increase in low-level 
noise in the direct-compression than the sinogram-compression images, particularly in 
the heart and subcutaneous fat regions. 

15 In summary, the method of the present invention comprises taking the raw 

projection digital data from a CT scanner, stripping it out of the proprietary format 
typically provided by the CT scanner manufacturer if required and as desired for 
optimum results, and then compressing it including specifying a maximum allowable 
error of preferably about 8, and preferably at an amount that will achieve a 

20 compression of no greater than about 15:1, although the error could be different 

depending on the particular projection data. The compressed data may then be stored 
for archival purposes, along with an identifying index representing the particular 
views constructed and used by a medical professional in performing a diagnosis. 
Thus, the present method is effective not only to save the raw data, but also to save 

25 the necessary information required to reconstruct the particular images known to be 
accessed by a medical professional in making a diagnosis. Different compression 
schemes will have varying suitabihty for compressing sinogram data. Approaches 
based on wavelet decomposition are e3q)ected to be very effective for representing 
sinograms. Schemes that are based on recognized standards or are of low 

30 computational complexity are also desirable. The compression routine used is 

preferably that provided by JPEG-LS software, although other compression routines 
could be used so long as they are suited to rehably and effectively compress the data 



wo 01/76482 PCT/USOl/11510 

12 

arrays typically found in CT scans, for maximum benefit. Gtenerally, a compression 
routine is desirable should it be able to significantly reduce the size of the data set 
given the type of data patterns found in projection data (as known in the art), aUow for 
specifying sufficient information to archive those views considered during a 
5 diagnosis, and allow for reliable reconstruction of images from data restored from a 
compressed data set. These compression routines are preferably preformed on a 
computer, and even more preferably on the computer typically included as part of a 
CT scanner, as shown in Fig. 1 1 Avith the data flows as shown in Fig. 10. The 
inventive method also includes reconstructing the projection data from its compressed 
10 form, and reconstructing images not only as previously specified by a medical 

professional but preferably also as might be later used or specified by a different or 
even the same medical professional. The compressed data set may be stored in any 
suitable storage medium as known in the art including magnetic or optical disk, tape, 
etc. 

15 Various changes and modifications to the invention may be made as known to 

those skilled in the art without departing from the spirit or scope of the invention. 
The foregoing description of the preferred embodiment should be viewed as 
illustrative and not exhaustive. The invention should be considered as being limited 
solely by the claims appended hereto and their legal eqmvalents. 
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